Towards Constructing Sports News from Live Text Commentary

نویسندگان

  • Jianmin Zhang
  • Jin-ge Yao
  • Xiaojun Wan
چکیده

In this paper, we investigate the possibility to automatically generate sports news from live text commentary scripts. As a preliminary study, we treat this task as a special kind of document summarization based on sentence extraction. We formulate the task in a supervised learning to rank framework, utilizing both traditional sentence features for generic document summarization and novelly designed task-specific features. To tackle the problem of local redundancy, we also propose a probabilistic sentence selection algorithm. Experiments on our collected data from football live commentary scripts and corresponding sports news demonstrate the feasibility of this task. Evaluation results show that our methods are indeed appropriate for this task, outperforming several baseline methods in different aspects.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Content Selection for Real-time Sports News Construction from Commentary Texts

We study the task of constructing sports news report automatically from live commentary and focus on content selection. Rather than receiving every piece of text of a sports match before news construction, as in previous related work, we novelly verify the feasibility of a more challenging setting to generate news report on the fly by treating live text input as a stream. We design scoring func...

متن کامل

Sports News Generation from Live Webcast Scripts Based on Rules and Templates

With the dramatic increase of the live webcast scripts about sports, it is an urgent demand to write and publish a sports news article immediately after a sports game. However, so far, the sports news articles are usually written by human experts or journalists, and the manual writing of sports news is timeconsuming and inefficient. This paper describes our system on the sports news generation ...

متن کامل

39. Opinion mining and sentiment analysis

Opinions are ubiquitous in text, and readers of on-line text — from consumers to sports fans to news addicts to governments — can benefit from automatic methods that synthesise useful opinion-orientated information from the sea of data. In this chapter on opinion mining and sentiment analysis, we introduce an idealised, end-to-end opinion analysis system and describe its components, including c...

متن کامل

NUS at WMT09: Domain Adaptation Experiments for English-Spanish Machine Translation of News Commentary Text

We describe the system developed by the team of the National University of Singapore for English to Spanish machine translation of News Commentary text for the WMT09 Shared Translation Task. Our approach is based on domain adaptation, combining a small in-domain News Commentary bi-text and a large out-of-domain one from the Europarl corpus, from which we built and combined two separate phrase t...

متن کامل

Overview of the NLPCC-ICCPOL 2016 Shared Task: Sports News Generation from Live Webcast Scripts

Live webcast scripts are valuable resources for describing the process of sports games. This shared task aims to automatically generate sports news articles from live webcast scripts. The task can be considered a special case of single document summarization. In this overview paper, we will introduce the task, the evaluation dataset, the participating teams and the evaluation results. The datas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016